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This paper shows that an iterative process, with certain desirable 
convergence properties, can be used to compute the solution of an 
important general equation when certain conditions are met. More 
specifically, let f be a function from U into B, where B is a Banach 
space and Uisa nonempty open subset ofB. One main result reported 
on is a proof of the existence of a superlinearly convergent algorithm 
that globally converges to a solution x of f(x) = a for each a G B, 
whenever f is a C^-diffeomorphism of U onto B, and either B = R n or 
f satisfies certain other conditions that are frequently met in appli- 
cations. For the case of an important class of monotone diffeomorph- 
isms f in a Hilbert space H (examples arise, for example, in signal- 
theory studies), the "other conditions" reduce to simply the require- 
ment that f '(the F-derivative off) be uniformly continuous on closed 
bounded subsets of H. 

I. INTRODUCTION AND OUTLINE OF RESULTS 

Let f be a function from U into B, where B is a Banach space with 
norm | • | and U is a nonempty open subset of B. We say that f is 
differentiable on a set S C U if /has a Frechet derivative f'(s) at each 
point s of S.f (If, for example, B = R" with the usual Euclidean norm, 
then /is differentiable on U if it is continuously differentiable on U in 
the usual sense.) By / a C x -diffeomorphism, we mean that / is a 
homeomorphism of f/onto B, and/' and (f~ 1 )' exist and are continuous 
on U and B, respectively. (We emphasize that here continuity refers to 
the dependence of the derivatives on the points at which they are 
evaluated, not to their boundedness as operators, which is assured by 
definition.^) 



* Results reported were part of an invited talk at the Workshop on Nonlinear Circuits 
and Systems of the 1980 IEEE International Symposium on Circuits and Systems, 
Houston, Texas, April 1980. 

f In other words, f is differentiable on S C U if for each s E S, there is a bounded 
linear map f'(s):B-+ B such that f(s + h) = f(s) + f'(s)h + o( \ h\ ) for (s + h) S U. 

$ And, of course, this continuity is with respect to the usual induced norm of a 
bounded linear map of B into B. 
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C'-diffeomorphisms frequently arise in applications. One of the main 
purposes of this paper is to show the existence of a superlinearly 
convergent algorithm that globally converges to a solution x of f(x) 
= a for each a G B, whenever f is a C'-diffeomorphism and either B 
= R n or /"satisfies certain additional conditions that are frequently met 
in applications. 

With /"not necessarily a C^diffeomorphism, suppose that aE. B and 
an initial point x° G U are specified. Let L denote {u G U: \ f{u) — a \ 
< | f(x°) — a | }, and assume that /' exists on L. Consider the following 
process (which, possibly, might not be able to be carried out) for 
generating a sequence x 1 , x 2 , • • • of points in U. 

Process N: Define s(v) = \ f(v) — a | for v G U. Choose real numbers 
k and a such that < k < x h < a < 1. For k = 0, 1, • • • do the following. 

If /•(**) = a, set x k+1 = x k . 

lif(x k ) ¥> a, determine fa G B such that f'(x k )fa = a - f(x k ). Then 
choose yk > such that (x k + ykfa) G U and 

K ^ 2y^sW) *°> 

and take y* = 1 when possible. Set x k+1 = x k + jk<t>k- 

Since x k+l - x k = ynfa when /"(x*) / a, with y k a scalar and fa as 
indicated, Process N is a Newton-direction process with a particular 
subprocess for determining each steplength yk\fa\. Of course, if 
Y* = 1 for all k, then the iterates x 1 , x 2 , • • • are the same as those 
produced by Newton's method, assuming that each f'(x k )~ l exists.f 

All our results concern Process N and are given in Section II, where 
we first prove two preliminary theorems. The first of these, Theorem 
1, provides conditions on /, a, and x°, under which Process N can be 
carried out and any sequence x 1 , x , • • • produced is a sequence of 
eventually arbitrarily good approximate solutions of f(x) = a, in the 
sense that | f(x k ) — a|— »0as£— ►<». Under the assumptions that the 
process can be carried out and that we have x k — > x as k — > oo, where 
x satisfies f(x) = a, Theorem 2 shows that x\ x 2 , • • • converges 
superlinearly in the usual sense that | x k+1 — x \ = c k \ x k — x \ , k > 0, 
where Ck -* as k — > oo, provided merely that (f')~ l exists and is 
continuous in some neighborhood N x of x. Process N is similar to an 
algorithm studied in Ref. 1, pp. 43-44, in connection with mappings in 
R n . Results related to Theorems 1 and 2 are given in Ref 1 where, 
however, the strong hypothesis that /"is twice continuously differenti- 
able plays a central role in the proofs, as does the use of the sequential 



f With regard to the convergence properties of Newton's method and the results 
given in this paper, it is a simple matter to give an example of a diffeomorphism /of R 
onto R 1 such that f(0) = and for some x° the Newton iterates for a = do not converge. 
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compactness of closed bounded sets in R n . (For a general B ¥* R n , 
closed bounded subsets of B need not be sequentially compact.) 
Newton-direction algorithms, which are sometimes referred to as 
"damped Newton" methods, have been considered also by other 
writers (see, for example, Ref. 2, which contains a good bibliography). 

Theorems 1 and 2 are used in Section II to prove three results 
concerning C^difTeomorphism and Process N. Theorem 3 asserts that, 
whenever / is a C'-diffeomorphism with certain properties and a and 
x° are given, Process N can be carried out, and any sequence x 1 , x 2 , 
• • • generated converges superlinearly to the unique solution x of f{x) 
= a. Theorem 4 shows that the conclusion of Theorem 3 holds for an 
important class of "monotone operators" which is of interest, for 
example, in connection with studies of nonlinear elliptic partial differ- 
ential equations, demodulation of very wide band frequency-modu- 
lated signals, recovery of distorted signals, and nonlinear networks. 

Our final result, Theorem 5, is a characterization of C 1 -diffeomorph- 
isms in R n ; it asserts that a continuously differentiable map f:U^> R n 
is a C^-diffeomorphism if and only if Process N can always be carried 
out, and for each a always produces a sequence of iterates that 
converges superlinearly to a solution x off(x) = a that does not depend 
on the initial point x°. A similar characterization in terms of a steepest 
descent process is given in Ref. 3.f4 

Sections 2.2, 2.5, and 2.6.2 contain comments concerning Theorems 

I, 3, and 4, respectively. An example of an /that meets the hypotheses 
of Theorem 4, and for which B is not finite dimensional is given in 
Section 2.6.3. 

II. THEOREMS CONCERNING PROCESS N 

Recall that f, U, B, the set L, Process N, and the terms "differenti- 
able" and "C^diffeomorphism" are defined in Section I. Throughout 
this section, 6 denotes the zero element of B, and if A : B —* B is a 
bounded linear map, then | A | denotes the usual induced norm of A. 
By for f uniformly continuous on a set S C U is meant, as usual, that 
for each 5i > there is a 82 > such that u, v G S with \u — v\< 82 
implies that \f(u) — f(v) \ < 81 or \f'(u) — f'(v) | < 81, respectively. 

Theorem 1: Let aE.B and x° E. U be given. Assume that L is bounded, 
that f is differentiable on L, that f and f are uniformly continuous on 
L, that f'{u)~ l exists for each u E L, and that there is a constant K 

f A related early result concerning the use of Newton-direction algorithms in R", in 
which /is assumed to be twice continuously differentiable, is contained in the appendix 
of Ref. 4, and some recent peripherally related material on Newton-direction algorithms 
in R" is given in Ref. 5. 

^This writer is indebted to his colleague D. J. Rose, for discussions concerning 
general aspects of numerical analysis, and for reawakening this writer's interest in 
Newton-direction algorithms. 
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such that | f'(u)~ l | < Kfor uGL. Then Process N can be carried out, 
and any sequence Xi, #2, • • • generated by the process satisfies \ fix* ) 
— a I — » as k —* 00. 

2. 1 Proof of Theorem 1 

Assume, for the purpose of the following inductive argument, that k 
is an integer for which the process can be used to determine x j for 
sj < k and that x k E.L, which is certainly true for k = 0. If f(x k ) = 
a, then obviously x k+i can be determined and x k+1 £ L. 

Suppose now that f(x k ) ^ a. We have x k + y</>* £ Ufor sufficiently 
small y > 0, because U is open. Let g(x k , y) denote 

s 2 (x k ) - s 2 (x k + y<f> k ) 
2ys 2 (x k ) 

whenever the latter is defined. Since /"is differentiable at x k , s(x k + h) 
= | f(x k ) -a + f'(x k )h + 5 1 for (x* + fc) £ £/, in which 1 8 \ / 1 h \ -> as 
| h | -»• 0. Let h = - yf'(x k y\f{x k ) - a], with y > such that (x k + h) 
£ U. We have s(x k + y<|>*) = | (1 - y)[f(x k ) - a] + d\. Since |(1 - 
y)[f(x k ) - a]\ - \8\ < |(1 - y)[f(x k ) - a] + 8\ < |(1 - y)[f(x k ) - a]\ 
+ \8\, clearly |(1 - y)[f(x k ) - a] + 8\ = |(1 - y)[f(x k ) - a]\ + t,|5| 
where i\ £ [— 1, 1] depends on y. Thus, for sufficiently small y > 0, 



g(x\ y) - 1 

-y 2 |/(^)-ar-2(l-y)r,|/(x*)-a|.|5|-r, 2 |5| i 



(1) 



Since 



2y\f(x*)-a\ 2 

\8\ \h\ 

(2) 



y|/(**)-^l \\h\ Y |A**)-«r 

and I h \ y _1 1 f(x k ) — a \ ~ l < K, we see that the left side of (2) approaches 
zero as y — > 0. Similarly, 

'*' 2 I*' 2 '*l 2 .^^iii!, (3) 



y\f(x k )-a\ 2 \h\ 2 y\f(x k )-a 

which shows that the extreme left side of (3) approaches zero as y — > 
0. Thus, using 1 77 1 < 1, g(x k , y) — » 1 as y — > 0. We use this fact as 
follows. 

Let S denote {y > : x* + a<$> k G L for a £ [0, y]} . Notice that S is not 
empty, because g{x*, y) > for sufficiently small y > 0. Let y* = sup 
S, and observe that y* is finite, since L is bounded and <f>k ^ 6. Also, x k 
+ y*<j>k £ L because, by the continuity of f, L is closed. It follows that 
y* £ S, which requires that g(x k , y*) < (because if gix*, y*) > 0, we 
arrive at the contradiction that for some A > 0, x k + afa £ L for a £ 
[y*, y* + X]). Since g(x k , y) is defined and continuous for y £ (0, y*], 
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and using g(x k , y*) < as well as g(x k , y) -* 1 as y — > 0, there is a y* 
£ (0, y*) such that k < #(x\ y k ) < a, and, of course, for that y*, we 
have x* +1 £ L. 

Thus, we can determine x k+i as required and x k+l £ L. This shows 
that the process can be carried out. 

Now let x 1 , x 2 , • • • be any sequence produced by Process N. Suppose, 
for the purpose of obtaining a contradiction, that | f(x k ) — a | -A as 
k —> oo. We have |/(x*) — a\ ¥* for all k. Observe that, since (1 - 
2i<yk)s 2 (x k ) > s 2 (x k+l ) for k > 0, the existence of a positive constant c 
such that yk ^ c for all k implies that | f(x k ) — a | — ► as k — > oo. 

For all A; we have x k £ L, |/(x* +1 ) - a\ < \f(x k ) - a\ , and g(x k , 
yk) < a. Let {**'} be a subsequence of {x k } such thaty*. — » as i — » 
oo. Since | /(x*) — a \ is monotone decreasing for increasing k and | f(x k ) 
— a | 7^ as k — » oo, there is a constant p > such that | /"(o^' ) — a \ > 
p for all i. 

Let A, denote -yk,f'(x ki )~\f{x ki ) - a], and notice that \hi\ < 
y*,^ | f(x°) - a | for aU i. Let 5, = f(x*' + hi) - /"(**' ) - f'(x ki ) hi for each 
i. 

Lemma: Under the hypotheses of Theorem 1, y*, _1 |5,| — » as 
i— > oo. 

Proof: In the following, let i > 2, notice that | /(x*' ) — a \ < \ f(x l ) — a \ 
< | f(x°) - a | , and let v denote V2 ( | /"(x ) - a \ - \ fix 1 ) - a \ ). By the 
uniform continuity of /, choose 8 > so that, if x" and x h are in L and 
I x" - x b I < 5, then | /"(x a ) - f(x b ) | < f. 

Assume that | hi | < 8, which is satisfied for all sufficiently large i. To 
justify the use of a certain integral representation given below, we first 
establish that x*' + /?/i, £ L for f3 £ [0, 1]: If it were not true that x ki 
+ fihi £ L for (S £ [0, 1], and since *** is an interior point of L, there 
would be a /?, £ (0, 1) such that**' + frfo £ L and | /"(x* 1 + f}Ji t ) - a \ 
= \f(x°) - a I . On the other hand, | /(**•" + )8,/i,) - a | < \f(x k -) - a \ + 
|/(x*' + AA.) " A**')! ^ l/U 1 ) - a I + j/ < |/-(x°) - a\, which is a 
contradiction. 

Therefore, we have 



/■(x*' + /i,)-Ax*')= f'{x k - + flhi)dj3.hi, 



and so 



5.1 = 



[f'(x k - + phi)-f'(x k -)-\dp.hi 



\f'(x k - + fih l )-f'(x ki )\d/i>\hi\ 
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< y k K | f(x°) - a | sup | /'<**■■ + £fc) - fCrt I • 

pe [o,i] 

By the uniform continuity of f, and | h, i\ — > as i— > oo, we have 
Y*. _1 1 5i | — > as i — > oo, which proves the lemma. 
With gix*, y) as defined earlier, we have (see the steps leading to 

(D), 

ji-v* ,Mfti _ T,, 2 i5,r 



*<**', Y*,) 1 Y*. .. feJ/( ^. ) _ a| 2^1/(^*0 -a| lf 



in which | tj, | < 1, for all i. By the lemma, using the bound | f(x ki ) — a \ 
> p, we have #(**', y*,-) — > 1 as i — > oo. This contradicts the inequality 
gist", y*,) < a for all i, and thus | f(x k ) — a \ — > as k -» oo. 

2.2 Comments 

With regard to the hypotheses of Theorem 1, a sometimes useful 
sufficient condition for f and f to be uniformly continuous on L is 
simply that f and f satisfy uniform Lipschitz conditions on closed 
bounded subsets of U. 

For the important special case in which B = R" and | • | is the usual 
Euclidean norm, a relatively short proof can be given of the proposition 
that, under the hypotheses of Theorem 1, we have | f(x k ) — a \ —* as 
k —> oo for any sequence generated by Process N: Suppose that | f(x k ) 
-a\-/>0aak^>oo.We have | f(x k ) -a\<\ f(x°) - a | for all k > 0. 
Since {x k } is contained in the compact set L, we can construct a 
convergent subsequence {**''} with limit x* E L such that limi_ooY*, 
= and f(x*) ¥> a . Since x* satisfies | f(x*) - a\ < \ f(x°) - a\, £ 
contains an open neighborhood of x*, and we may therefore assume 
without loss of generality that {**''} is contained in a convex subset of 
L. 

By the mean-value theorem, 

g{x* y*,) = -< Vs 2 ^,), ^,)/2s 2 (x*0 

in which { • , • ) denotes the usual inner product, V the gradient, 
and j\k. is a point on the open-line segment between x ki and x ki 
4- ya, </>*,- Thus, with "tr" denoting the transpose, 

g(* . Y*,) 2|A^')-a| 2 ' 

from which it is clear thatg(x ki , y*,) — > 1 as i — * oo, and this 
contradicts g(x k \ y*,) — for all i, which completes the proof. 

Theorem 2: Let a £ B and x° E U be given. Assume that there is an 
x E U such that f(x) = a, that Process N can be carried out, and that 
it generates a sequence x 1 , x 2 , • • • such that f(x k ) ¥> a for all k, and 

1 726 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1 980 



x* — » x as k —> oo. Assume also that f and {f')~ l exist in a 
neighborhood N x of x, and that (/') _1 is continuous at the point x. 
Then there is an integer M such that y* = 1 for k>M, and we have 

|x* +1 -x| (| x k - x I)" 1 ->0 as k-* oo. 

2.3 Proof of Theorem 2 

We assume, without loss of generality, that x k G N x for all k. Let J 
and J k denote f'(x) and f'(x k ), respectively. Since fa =Jl x (a - f{x k )) 

- (Jk 1 - J- X )(a - f(x k )) + J~\a - f(x k )), we have |**| < \J^ - 
J~ l I • I a - f(x k ) | + | J' 1 1 • | a - f(x k ) | for all k. By the continuity of 
(f')~ l at x and f(x k ) —* a as k — » oo, it is clear that fa —* 6 as k — * oo. 
Therefore, there is an integer m such that x k + fa G Ufork> m. With 
regard to the existence of M, observe that, since k < l h < a, it suffices 
to show that #(**, 1) -» VS as A ^ oo (# is defined in the proof of 
Theorem 1), or, equivalently, that t(x k , 1) given by t(x k , 1) = s(x k + 
fa)/s{x k ) for k>m, satisfies t(x k , 1) —* as k -* oo. 

Let k> m. Since /"' exists at the point x, /(x*) — a = J(x k — x) + 
5 and /(x* + fa) - a = J(x k + fa-x) + A, with 8 = o{\ x k - x |) and 
A = o( | x* + fa — x | ), and of course <J>* = — J\~ \ J(x k — x) + 8]. Since 
J is an invertible bounded linear operator from B onto B, there are 
positive constants /?i and /? 2 such that (3i\ u\ < \Ju\ < ft 2 \u\ for u E. 

at 

Thus, for some m > m, | J(x* - x) | > 2 1 6 1 and | J(x k + fa- x)\> 
| A | for k > m. We have, for A: > m, s(x*, 1) < 4 1 J(x* + ^ - x) \ • \ J(x k 

- x) | _1 < 4/320T 1 1 x* + fa - x | • | x* - x | _1 . Therefore, for A > m, 

'AfaP^six*, 1) < | x k - x \~\ | x k - x - (x k - x) 

- J~ l 8 +(J~ 1 - Jk 1 )[J(x k - x) + 8] |) 

< \x k - x^WJ-'Sl + | J" 1 - J* 1 ! 

• \J(x k -x)\ + \J- 1 - Jk l \ • \8\). (4) 

UsinglJ-^l </?r 1 |5|,asweUas5 = o(|x*-x|) and | J' 1 -Jl l \ ^ 
as k — > oo, we see that t(x k , 1) ^ as k —* oo, which proves the 
existence of M such that x* +1 = x k + fa for & > M. Finally, notice that 
the right side of (4) is | x* +1 - x | • | x* - x | _1 for k > M. Thus, our 
proof shows that |x* +1 — x| • |x* — x| -1 — » as k — > oo, which 
completes the proof of Theorem 2. 

Theorem 3: Assume that f is a C 1 -diffeomorphism. Let f and f be 
uniformly continuous on closed bounded subsets of U, and let \ ( f~ l )' \ 
be bounded on closed bounded subsets ofB. Then for each aEB and 
each x° G U, Process N can be carried out; any sequence x 1 , x 2 , 

f The left inequality is a consequence of the proposition, due to Banach, that if A : B 
—* B is a bounded linear invertible operator, then A -1 is bounded. 
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generated by the process converges to the unique solution x of f(x) 
= a, and is such that we have \ x k+1 — x\ = Ck\x k — x\ for £ > 0, in 
which Ck — > as k — * oo. 

2.4 Proof of Theorem 3 

Let a and *° be given. Let p be a constant such that | ( f~ l )'(u) | < p 
for u E {u E B : \ u | < | a \ + \ f(x°) - a\). If v G L, then f(v) = a + 
b with | b | < | f(x°) - a \ . For any such b, 



f- l (a + b) = f~ l {a) + (f T(a + fib) dfi . b 

Jo 

and thus | f~ l (a + b) | < | f~ l (a) \ + p\ f(x°) - a \ , which shows that L 
is bounded. 

Since /is a C'-diffeomorphism, u = f~\f{u)~\ foruEU and v = 
f[f _1 (^)] f° r v G B. By the continuous differentiability of /and /', we 
have, with I the identity operator on B, I = (/ -1 )'(f(u)) • /'(m) for m 
G C/, and 7 = /'[/"'fa)] • (/ -1 )'(i>) for v G B. Now set v = f(u), and 
observe that, for u G 17, 7 - (/ _1 )'(/(«)) • /'(m) and 7 = /'(«) . 
(/ _1 )'(f(u)). Thus /'(m) -1 exists, because /'(«) possesses both a left 
inverse and a right inverse, and it satisfies /'(«) _1 = (/ _1 Y(f(u)) for u 
G £/. Let ^satisfy | (/ _1 )'(m;) | < iiTfor we. {wEB : | u>| < | a| + j f(x°) 
— a | }. Choose v E. L, and notice that | /'(f) -1 1 < Jf because | /(u) | < 
\a\ + \f(x°) -a\. 

By Theorem 1, Process N can be carried out, and any sequence that 
it generates satisfies | f(x k ) — a | — » as k — > oo. Let x 1 , x 2 , ••• be any 
such sequence. 

Clearly, x k = f\a + 8 k ) with \8 k \ -+ as k -+ oo. Since / _1 is 
continuous, x k —* x as k — * oo, where x is the unique solution of f(x) 
= a. 

If x° ¥" x, then x is an interior point of L, and is therefore contained 
in some neighborhood N x of x such that 2V, C L. For w E N x ,we have 

l/w- 1 - /'ur 1 ! = i/'M^tn*) - f(wnf'{x)- i \ < ir 2 i/'u) - 

f'(w) I , which shows that ( /' ) l is continuous at x. Thus, by Theorem 
2 and the observation that if x J ' = x for somey, then x k = x for k > j, 
there is a sequence Co, Ci, • • • with the properties stated in Theorem 
3.f This completes the proof. 

2.5 Comment 

Here we give a related result concerning conditions under which / 
is a C^diffeomorphism, which is sometimes useful. Assume that /' 
exists and is continuous on U, and that U is convex. Let Ai and A 2 be 
any two families of compact subsets of B and U, respectively, such 

| In this connection, from Theorem 2 and known local properties of Newton's method, 
it follows that we have quadratic convergence (i.e., Ix* l -*|</?|;t*-.x| z forA!>0 
and some constant ft) if in addition /' is locally Lipscnitzian on U. 
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that if C and D are compact sets in U and B, respectively, then there 
aieSEAi and T G A 2 such that C C T and D C S. 

If / w a C '-diffeomorphism, we have, as in the proof of Theorem 3, 
/= (/ _1 )'(/(")) • flu) and /= f'(u) • (r l )'(f(u)) for u G £/, and 
thus then /'(u) ' exists for u E U. This observation, together with 
Theorem 3 of Ref. 6, and the implicit function theorem in Ref. 7, p. 
273 show that /"is a C '-diffeomorphism if and only if /'(a)" 1 exists for 
each u E. U, and for each S G A 1} there is a T G -A 2 such that f(s) G S 
implies that s G T. A corresponding result for £ = £/ = R" is well 
known. 8 

2.6 Monotone diffeomorphisms In Hilbert space 

Let \[/:[0, oo ) — > [0, oo) be continuous, strictly increasing, and such 
that \p(0) = 0, \p(a) -> oo as a -> oo, and oTtyfa) > c for « £ (0, a) for 
some positive constants c and a. Notice that, for example, \p(a) = a 
meets these conditions. 

Theorem 4: Let f map a real Hilbert space H, with inner product 
<•,•), into itself such that ( f(u) - f(v), u — v) >\u - v\\p(\u — v\) 
for all u,vEH. Assume that f exists and is uniformly continuous on 
closed bounded subsets ofH. Then fis a C x -diffeomorphism of H onto 
H, and the conclusion of Theorem 3 (with U and B replaced with H) 
holds. 

2.6. 1 Proof of Theorem 4 

Since /"is continuous and satisfies the indicated inequality with \p : 
[0, oo) — > [0, oo) continuous, strictly increasing, and such that \p(0) = 
and \p(a) —* oo as a — * oo, it can be shown that f is a homeomorphism 
of H onto H 9 (see also Refs. 10 and 11 for early results along similar 
lines). 

Now let h G H with h ^ be arbitrary, and let v be any point in H. 
Since < f(v + th) - f(v), til) > | th \ if(| th |) for each real number t, it 
follows from the existence of /', and the inequality a'^ia) > c for a 
G (0, a), that \h\~ 2 ( f'(v)h, h) > c. This implies that f'(v)' 1 exists and 
that | f'(,v)~ x | < c _1 for v G H. n Since /is onto, by the inverse function 
theorem in Ref. 7, p. 273, f~ l is continuously differentiate on H. 
Therefore, /is a C '-diffeomorphism of H onto H. 

As in the proof of Theorem 3, ( f~ x )'( f(u)) = f'(uy l for uEH, which 
shows that | ( / _1 )' | is bounded on H. Finally, by the uniform continuity 
°f /'» I f I is bounded on convex closed bounded sets, and since any 
bounded set in H is containeu in some convex closed bounded subset 
of H, it easily follows that /is uniformly continuous on closed bounded 
sets.f Therefore, by Theorem 3, the proof is complete. 



t This type of argument shows also that the uniform continuity requirement on fin 
Theorem 3 can be dropped when U = B. 
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2.6.2 Notes 

An early related result is the following. If H is a real or complex 
Hilbert space, and /: H—> His Lipschitz as well as uniformly monotone 
in the sense that for some constant c> 0, we have 

Re</(«)- f(v),u-v)>c\u-v\ a for u,v(EH, 

then f is a homeomorphism of H onto H, and, with A the Lipschitz 
constant, given a G H, and any starting point x° E H, the iterates x\ 
x 2 , ■ • • defined by 

x k+1 = x k + c\- 2 (a - f(x k )), k>0 

converge to the solution x of f(x) = a. n f A similar proposition is given 
in Ref. 13, p. 131, and a modification of the proposition described 
above, of use in Banach spaces, is given in Ref. 14. 

2.6.3 An example 

Here we give a simple example of a uniformly monotone operator 
for which H is not finite dimensional and the hypotheses of Theorem 
4 are met. The example arises in connection with results concerning 
the recovery of distorted signals (see Ref. 11 and the work of Beurling, 
Landau and Miranker, and Zames cited there). 

Let Li denote the linear space of square-integrable functions from 
R 1 into /f 1 , let the inner product ( • , • ) in L 2 be given, as usual, by 



(u, v) = u(t)v(t) dt, 

and let | • | denote the corresponding norm. Let H be the closed 
subspace of elements v of L 2 for which V(io), the Fourier transform of 
v, given by 

V(u) = l.i.m. I v(t)e~^ dt, 



(w) = l.i.m. v 

J — 00 



vanishes for almost all w £ B, where fl is a fixed bounded interval 
[—wo, wo]. Of course, for any v G Hwe have 

i r° 

"(*)=— V(w)e**dw, a.e. (5) 

and we shall use the fact that 

\ 1/2 



Wo \ 

— I \v\, vEH, 



(6) 



t For a particularly interesting application, see Ref. 15. 
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which easily follows from (5), the Schwarz inequality, and Plancherel's 
identity. 

Let P : Z/2 — * Z/2 denote the projection operator that projects L% onto 
H, and let Q : H — > Li be defined by the condition that for each v G H, 
(Qv)(t) = q[v(t)] for almost all t, where q : R l — > R 1 is continuously 
differentiate, such that <7(0) = 0, and such that there are positive 
constants c and X with the property that c < q'(a) < X for all a. Finally, 
let f: H^> H be given by f= PQ. 

It can easily be verified that (Q(u) — Q(v), u — v) > c\ u — v\ 2 for 
u, v E H. Since P is self-adjoint, < /(u) — /(u), m — u) > c \ u — v | 2 for 
u and d in H. Similarly, using \P\ = 1, we have | f(u) — f(v) \ < 
X I u — v I for u, v G i/. 

For each y G i/, let R v : H -> L 2 be defined by (i? t ,u)(i) = q'[v(t)]u(t) 
for almost all t and any 1/ G H. (The definition makes sense because 
q'[u(-)] is measurable, and we have ess^sup | q'[v(t)] \ < 00.) 

Now let v G H be arbitrary but fixed, and let 8(h) E.H be given by 
8(h) = (PQ)(v + h)- (PQ)(v) - (PR v )h for each h G H. Using \P\ 
= 1, we have \8(h)\ <\Q(v + h) - Qv - R v h\, and thus \8(h)\ < 
p(/l)|/l|, where 



r 



p(A) = ess,sup \q'[v(t)+ph(t)]-q'[v(t)]\dp. 

Jo 

Since q' is uniformly continuous on compact sets, ess,sup | v(t) \ < 00, 
and essfsup \h(t)\—> as \h\—> [see (6)], we have p(h) —> as | h \ 
-> 0. This shows that f'(v) exists and that f'(v) = PR V . 

On the other hand, | PR Va - PR Vb | < sup{ | (R Va - R„ b )w\ : w G H, 
I w I = 1} for v a and i>6 in H, where | PR Va — PRv b \ denotes the induced 
norm sup{ | (PR Va - PR Vb )w \ : w e H, \ w\ - 1} of (Pi?„ a - PR Vb ) : i/ 
^ ff. Thus, 

\PR L[a - PR Vb \ < ess,sup|g'|>a(*)] ~ q'[v b (t)]\ 

for v a ,VbE. H. Now let y be an arbitrary positive constant, and suppose 
that v a and Vt in H satisfy | v a \ ^ y and | v b \ < y. Let a > be another 
constant. Using (6), ess/sup max( | v a (t) \ , | Ufc(t) | ) < K where K = (0)0/ 
7r) 1/2 y. By the uniform continuity of q'( •) for | a \ < K, let 8 be such 
that J ^'(ai) - q'(a 2 )\ < a when | ai | < iiT, | a 2 1 < /T, and \ai — a 2 \ < 
8. Thus, by (6), if\v a -v h \< 8(w /tt)- 1/2 , then\PR Va - PR L , b \ < a. 
Since a is arbitrary, f is uniformly continuous on bounded subsets of 
H. This shows that f satisfies the hypotheses of Theorem 4 (and it 
proves that there is a superlinearly convergent algorithm that recovers 
bandlimited signals that are nonlinearly distorted by Q and subse- 
quently bandlimited by P). 

Theorem 5: Let B = R n , and let f be a continuously differentiable 
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map of U into R". Then f is a C l -diffeomorphism of U onto R n if and 
only if 

(i) Process N can be carried out for each a £ R n and each x° G 

U. 
(ii) For each a, any sequence produced by the process converges 

superlinearly to a solution x off(x) = a, and x does not depend 

on x°. 

2.7 Proof 

If f is a C^-diffeomorphism, then f, f, and (f~ 1 )' are continuous, 
and then, since closed bounded subsets of R n are compact, /, f, and 
| ( f~ 1 )' | are uniformly continuous and bounded, respectively, on closed 
bounded sets. Thus, by Theorem 3, (i) and (ii) hold when f is a C 1 - 
diffeomorphism. 

On the other hand, suppose that (i) and (ii) are satisfied. Then 
f'(u)' 1 exists for each u, because otherwise there would be an x° and 
an a such that a ^ f(x°) and such that there is no solution <£o of 
f'(x°)<l>o = a — f(x°). Also, f is one-to-one and onto. Since there is an 
x such that f{x) = a for each a G R n , and f'(u)~ l exists for all u, it 
follows from a standard inverse function theorem (see, for example, 
Ref. 7, p. 273) that (f~ 1 )' exists and is continuous on R n . Thus, (i) and 
(ii) imply that / is a C^-diffeomorphism. 

2.8 Comments 

Our concern throughout this paper is primarily with studies of what 
is possible. We do not claim that of all applicable Newton-direction 
algorithms, Process N is the most efficient. In fact, it is not difficult to 
modify Process N to improve its performance in certain specific cases. 
Also, results along the lines of this paper can be proved using Newton- 
direction algorithms that, unlike Process N, do not typically require, 
for the determination of the y*, a one-dimensional search procedure 
for a finite number of values of k. These results will be reported on in 
later papers. 1617 
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